Multidimensional Anlaysis of XML Document Contents with OLAP Dimensions
نویسندگان
چکیده
With the emergence of Semi-structured data format (such as XML), the storage of documents in centralised facilities appeared as a natural adaptation of data warehousing technology. Nowadays, OLAP (On-Line Analytical Processing) systems face growing non-numeric data. This chapter presents a framework for the multidimensional analysis of textual data in an OLAP sense. Document structure, metadata, and contents are converted into subjects of analysis (facts) and analysis axes (dimensions) within an adapted conceptual multidimensional schema. This schema represents the concepts that a decision maker will be able to manipulate in order to express his analyses. This allows greater multidimensional analysis possibilities as a user may gain insight within a collection of documents.
منابع مشابه
A Conceptual Model for Multidimensional Analysis of Documents
Data warehousing and OLAP are mainly used for the analysis of transactional data. Nowadays, with the evolution of Internet, and the development of semi-structured data exchange format (such as XML), it is possible to consider entire fragments of data such as documents as analysis sources. As a consequence, an adapted multidimensional analysis framework needs to be provided. In this paper, we in...
متن کاملEfficient Compression and Storage of XML OLAP Cubes
In this paper, the authors present an approach to efficiently compress XML OLAP cubes. They propose a multidimensional snowflake schema of the cube as the basic physical configuration. The cube is then composed of one XML fact document and as many XML documents as the dimension hierarchy members. The basic configuration is reorganized into two ways by adding data redundancy on purpose in order ...
متن کاملA New Multidimensional Model for the Olap of Documents Based on Facets
The OLAP (On-Line Analytical Processing) systems provide a multidimensional analysis of voluminous databases by generating a synthetic vision of data. Several studies have focused on the application of these OLAP techniques on structured and semi-structured data, and more specifically XML documents. In this context, we propose a new multidimensional model for the OLAP of XML documents. The prop...
متن کاملXML-OLAP: A Multidimensional Analysis Framework for XML Warehouses
Recently, a large number of XML documents are available on the Internet. This trend motivated many researchers to analyze them multi-dimensionally in the same way as relational data. In this paper, we propose a new framework for multidimensional analysis of XML documents, which we call XML-OLAP. We base XML-OLAP on XML warehouses where every fact data as well as dimension data are stored as XML...
متن کاملXldm: an Xlink-based Multidimensional Metamodel
The growth of data available on the Internet and the improvement of ways to handle them consist of an important issue while designing a data model. In this context, XML provides the necessary formalism to establish a standard to represent and exchange data. Since the technologies of data warehouse are often used for data analysis, it is necessary to define a cube model data to XML. However, dat...
متن کامل